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Abstract — Motivated by the looming "capacity crunch" in 
fiber-optic networks, information transmission over such systems 
is revisited. Among numerous distortions, inter-channel inter- 
ference in multiuser wavelength-division multiplexing (WDM) 
is identified as the seemingly intractable factor limiting the 
achievable rate at high launch power. However, this distortion 
and similar ones arising from nonlinearity are primarily due to 
the use of methods suited for linear systems, namely WDM and 
linear pulse-train transmission, for the nonlinear optical channel. 
Exploiting the integrability of the nonlinear Schrodinger (NLS) 
equation, a nonlinear frequency-division multiplexing (NFDM) 
scheme is presented, which directly modulates non-interacting 
signal degrees-of-freedom under NLS propagation. The main 
distinction between this and previous methods is that NFDM 
is able to cope with the nonlinearity, and thus, as the the signal 
power or transmission distance is increased, the new method 
does not suffer from the deterministic cross-talk between signal 
components which has degraded the performance of previous 
approaches. In this paper, emphasis is placed on modulation 
of the discrete component of the nonlinear Fourier transform 
of the signal and some simple examples of achievable spectral 
efficiencies are provided. 

Index Terms — Fiber-optic communications, nonlinear Fourier 
transform, Darboux transform, multisoliton transmission. 

I. Introduction 

THIS PAPER is a continuation of Part I 1 1 ] and Pait II 
|T| on data transmission using the nonlinear Fourier 
transform (NFT). [Part I] describes the mathematical tools un- 
derlying this approach to communications. Numerical methods 
for implementing the NFT at the receiver are discussed in 
[Part II]. The aims of this paper are to provide methods for 
implementing the inverse NFT at the transmitter, to discuss 
the influence of noise on the received spectra, and to provide 
some example transmission schemes, which illustrate some of 
the spectral efficiencies achievable by this method. 

The proposed nonlinear frequency-division multiplexing 
(NFDM) scheme can be considered as a generalization of 
orthogonal frequency-division multiplexing (OFDM) to inte- 
grable nonlinear dispersive communication channels ||T]. The 
advantages of NFDM stem from the following; 

1) NFDM removes deterministic inter-channel interference 
(cross-talk) between users of a network sharing the same 
fiber channel; 
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2) NFDM removes deterministic inter-symbol interference 
(ISI) (intra-channel interactions) for each user; 

3) spectral invariants as carriers of data are remark- 
ably stable and noise-robust features of the nonlinear 
Schrodinger (NLS) flow; 

4) with NFDM, information in each channel of interest can 
be conveniently read anywhere in a network indepen- 
dently of the optical path length(s). 

As described in [Part I], the nonlinear Fourier transform of a 
signal with respect to a Lax operator consists of discrete and 
continuous spectral functions, in one-to-one correspondence 
with the signal. In this paper we focus mainly on discrete 
spectrum modulation, which captures a large class of input 
signals of interest (see Section IVII i. For this class of signals, 
the inverse NFT is a map from 2N complex parameters 
(discrete spectral degrees-of-freedom) to an A^-soliton pulse in 
the time domain. This special case corresponds to an optical 
communication system employing multisoliton transmission 
and detection in the focusing regime. 

A physically important integrable channel is the optical fiber 
channel. Despite substantial effort, fiber-optic communications 
using fundamental soli tons {i.e., 1 -soli tons) has faced numer- 
ous challenges in the past decades. This is partly because the 
spectral efficiency of conventional soliton systems is typically 
quite low (p ^ 0.2 bits/s/Hz), but also because on-off keyed 
solitons interact with each other, and in the presence of noise 
the system reach is limited by the Gordon-Haus effect ||3]. 
Although solutions have been suggested to alleviate these 
limitations [3], most current research is focused on the use 
of spectrally-efficient pulse shapes, such as sine and raised- 
cosine pulses, with digital backpropagation at the receiver, e.g., 
[4]. Although these approaches provide a substantial spectral 
efficiency at low to moderate signal-to-noise ratios (SNRs), 
their efficacy saturates after a finite SNR ^ 25 dB where 
p ^ 5 bits/s/Hz. This, as we shall see in Section [III is due 
to the incompatibiUty of the wavelength-division multiplexing 
(WDM) with the flow of the NLS equation, causing severe 
inter-channel interference. 

There is a vast body of literature on solitons in mathematics, 
physics, and engineering; see, e.g., f3], |'5l-|'8l and references 
therein. Classical, path-averaged and dispersion-managed fun- 
damental 1-soUtons are well-studied in fiber optics |I3]. The 
existence of optical 7V-soliton pulses in optical fibers is also 
well known |3|; this previous work is mostly confined to 
the pulse-propagation properties of A^-solitons, is usually 
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limited to small N (e.g., N — 2,3), or focusses on specific 
isolated A^-solitons {e.g., pulses of the form Asech{t)). Signal 
processing problems (e.g., detection and estimation) involving 
soliton signals in the Toda lattice and other models have been 
considered by Singer [8 |. There is also a related work by 
Hasegawa and Nyu, "eigenvalue communication" LL6|, which 
is reviewed in Section IVI-FI after NFDM is explained. 

While a fundamental soliton can be modulated, detected and 
analyzed in the time domain, iV-solitons are best understood 
via their spectrum in the complex plane. In this paper, these 
pulses are obtained by implementing a simplified inverse 
NFT at the transmitter using the Darboux transform and 
are demodulated at the receiver by recovering their spectral 
content using the forward NFT. Since the spectral parameters 
of a multisoliton naturally do not interact with one another 
(at least in the absence of noise), there is potentially a 
great advantage in directly modulating these non-interacting 
degrees-of-freedom. Sending an iV-soliton train for large N 
and detecting it at the receiver — a daunting task in the time 
domain due to the interaction of the individual components — 
can be efficiently accomplished, with the help of the NFT, in 
the nonlinear frequency domain. 

The paper is organized as follows. In Section |II] we re- 
visit the wavelength-division multiplexing method commonly 
used in in optical fiber networks and identify inter-channel 
interference as the capacity bottleneck in this method. This 
section provides further motivation for the NFT approach 
taken here. In Section |III1 we study algorithms for implement- 
ing the inverse nonlinear Fourier transform at the transmitter 
for signals having only a discrete spectrum. Among several 
methods, the Darboux transform is found to provide a suitable 
approach. The first-order statistics of the discrete spectrum in 
the presence of noise are calculated in Section|IV] In SectionlVl 
we calculate some spectral efficiencies achievable using very 
simple NFT examples. Finally, we provide some remarks on 
the use of the NFT method in Section [Vl] and conclude the 
paper in Section IVIII 

II. Origin of Capacity Limits in WDM Optical 
Networks 

Recent studies on the capacity of WDM optical fiber net- 
works suggest that the information rates of such networks 
is ultimately limited by the impacts of the nonlinearity, 
namely inter-channel and intra-channel nonlinear interactions 
O. These distortions have deterministic and signal-dependent 
stochastic components that grow with the input signal power, 
diminishing the achievable rate at high signal-to-noise ratios. 
In these studies, for a class of ring constellations, the achiev- 
able rates of the WDM method increases with input power 
power, V, reaching a peak at a certain critical input power, 
and then asymptotically vanishes as ^ oo (see e.g., [4J and 
references therein). 

In this section we briefly review WDM, the method com- 
monly used to multiplex many channels in practical optical 
fiber systems. We identify the origin of capacity limitations 
in this model and explain that this method and similar ones, 
which are borrowed from linear systems theory, are poorly 



suited for efficient communication over nonlinear optical fiber 
networks. In particular, certain capacity limitations in the prior 
work are an artifact of these methods (notably WDM) and may 
not be fundamental. In subsequent sections, we continue the 
development of the NFDM approach that is able to overcome 
some of these limitations, in a manner that is fundamentally 
compatible with the structure of the nonlinear fiber-optic 
channel. 

A. System Model 

For convenience, we reproduce the system model given 
in [Part I]. We consider a standard single-mode fiber with 
dispersion coefficient (^2, nonlinearity parameter 7 and length 
C. After appropriate normalization (see, e.g., [Part I]), the 
evolution of the slowly-varying part q{t, z) of a narrowband 
signal as a function of retarded time t and distance z is well 
modeled by the stochastic nonlinear Schrodinger equation 

]q,{t, z) = qtt + 2|g(t, z)\^q{t, z) + n(t, z), (1) 

where subscripts denote differentiation and n{t, z) is a ban- 
dlimited white Gaussian noise process, i.e., with 

E {n{t, z)n*{t\ z')} - -^Sw{t - t')5{z - z'\ 

where 8w{x) = 2VFsinc(2T/Fx), P = and To = y^l^. 
The noise spectral density is given by ctq n^^ahv, with 
parameters given in Table U It is assumed that the transmitter 
is bandlimited to W for all < £ < £, and power limited 
to V, i.e., 

1 

E- \q{tM^dt = V. 

Note that the fiber loss is assumed to be perfectly compensated 
by distributed Raman amplification, and hence ([TJ contains no 
loss term. A derivation of ^ and a discussion about sources 
of noise in fiber-optic channels can be found in O, ||9 |. 

Stochastic partial differential equations (PDEs) such as ([T]) 
are usually interpreted via their equivalent integral repre- 
sentations. Integrating a function with unbounded variation 
can be problematic, since, e.g., a Riemann approximation 
11'^^'' g{z)AB{z) w gil){B{z + dz) - B{z)), for some 
/ G [z,z + dz], where B is the Wiener process, would depend 
on the choice of /. The choice of / leads to various inter- 
pretations for a stochastic PDE, notably Ito and Stratonovich 
representations, in which, respectively, I = z and / = z+dz/2. 
Since noise is bandlimited in its temporal component, the 
stochastic PDE ([T]) is essentially a finite-dimensional system 
and there is no difficulty in the rigorous interpretation of ([T]|. 



TABLE I 
Fiber Parameters 





1 


spontaneou.s emission factor 


h 


6.626 X lO-^^'j • s 


Planck's constant 


u 


193.55 THz 


center frequency 


a 


0.046 km~^ 


fiber loss (0.2 dB/km) 


7 


1.27 W-^km"^ 


nonlinearity parameter 


W 


125 GHz 


maximum bandwidth 
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B. Capacity of WDM Optical Fiber Networks 

Fiber-optic communication systems use wavelength-division 
multiplexing to transmit information. Similar to frequency- 
division multiplexing, information is multiplexed in distinct 
wavelengths. This helps to separate the signals of different 
users in a network, when they have to share the same links 
between different nodes. 

Fig. [T] shows the system model of a link in an optical 
fiber network between a source and a destination. There are 
N fiber spans between multiple users at the transmitter (TX) 
and multiple users at the receiver (RX). The signal of some 
of these users is destined to a receiver other than the RX 
shown in FigHI As a result, at the end of each span there is 
a ROADM that may drop the signal of some of the users 
or, if there are unused frequency bands, add the signal of 
potential external users. We are interested in evaluating the 
per degree-of-freedom capacity (bits/s/Hz) of the fiber link 
from the transmitter (TX) to the receiver (RX). 

In WDM, the following signal is transmitted over the 
channel 



q{t,0) 



N-1 /WT 

EE 

fe=0 \l=l 



jk2TTWt 



0<t<T, (2) 



where are the symbols transmitted by user k, <j)e{t) are 
pulse shapes, W > 1/T is the per-user bandwidth, and 
A'^ is the number of WDM channels. For the purposes of 
illustration, we can assume that each user sends an isolated 
pulse in the time interval [0,T]. Thus each user operates at a 
single frequency in a bandwidth W = 1/T and q{t,0) = 
J2k=o 1k(,0)G^p{jk2Trt/T), where {qk{0)} are the Fourier 
series coefficients at z = 0. As this signal evolves along z, we 
have a Fourier series with variable coefficients 



AT-l 



(3) 



k=0 



Substituting the periodic solution (O into ([T]l, we get the 
NLS equation in the discrete frequency domain 



j- 



dz 



ATi'W'k'qk{z) + 2\qk{z)Vqk{z) 



dispersion 

-2qu{z)Y,W{z)\' 



SPM 



t^k 



XPM 



-2 ^ qm{z)qi{z)q*^^i_k{z)+Vk{z), (4) 



l^k 



FWM 

in which are the noise coordinates in frequency and 
where we have identified the dispersion, self-phase modulation 
(SPM), cross-phase modulation (XPM) and four-wave mixing 
(FWM) terms in the frequency domain. 

It is important to note that the optical WDM channel is 
a (nonlinear) multiuser interference channel [|10J . The (inter- 
channel) interference terms are the XPM and FWM. There is 
no ISI in the assumed isolated pulse transmission model (|4|i 



with one degree of freedom per user. However in a pulse-train 
transmission model where WT > 1, replacing {q^} by {s^} 
via the inverse transform shows that the other two effects, the 
dispersion and SPM, cause inter-symbol interference (intra- 
channel interaction). Performance of a WDM transmission 
system depends on how interference and ISI are treated, and 
in particular the availability of the user signals at the receiver. 
Several cases can be considered. 

Assuming a given (perhaps suboptimal) receiver, (|2) is 
discretized as a map from degrees-of-freedom at the input 
to their estimate sj. at the output: 



. . . , S2, S2, 



si a2 
■ • 7 *2i ''21 • ■ 



If joint transmission and detection of all degrees of freedom 
sj, is practical, the channel is essentially a single user channel 
whose capacity is non-decreasing with average input power 



V 



1 



NWT 



N-l WT 

EEEi4r 

fc=0 1 = 1 



If joint transmission and detection is not possible (e.g., in 
frequency k or in time £), then one must adopt the strategy of 
treating interference as noise. In this case, one can examine 
various strategies, e.g., interference alignment ifTTI or signal- 
space orthogonalization. If one of these strategies can be suc- 
cessfully applied, then, again, the capacity is non-decreasing 
with v. If none of these strategies is applicable, or if additional 
constraints are present, the capacity of a WDM system may 
in some cases decrease with V. 

a) Single-user Memoryless Channels: The capacity-cost 
function of a single user discrete memoryless channel (DMC) 
with unbounded input alphabet (real- or complex-valued sym- 
bols with no peak power), is a non-decreasing function of 
a (equality or inequality constrained) linear cost [12fl. This 
follows from the concavity of the mutual information and the 
linearity of the cost as a function of the input distribution (see 
|12i for a detailed discussion about more general scenarios). 
This, of course, holds true for any set of transition probabilities 
(a DMC), including that obtained from nonlinear channels. 

Care should be taken when one has more than one degree- 
of-freedom, if additional constraints are present, or if the 
capacity-cost program is non-convex. For instance, when a 
signal propagates according to a nonlinear evolution equation, 
its spectrum can spread continuously. The amount of spectral 
broadening depends on the pulse shape and, in particular, on 
the signal intensity. In the absence of optical filtering, the 
per degree-of-freedom capacity (bits/s/Hz) of the single-user 
nonlinear channel is expected to be non-decreasing with input 
power (even under the suboptimal backpropagation receiver); 
however, it may asymptotically vanish in the presence of 
optical filtering. Here, by per degree-of-freedom capacity, we 
mean the ratio of the sum rate to the maximum number of 

'Note that this may not hold for finite alphabet channels (with equality 
power constraint), or when constraints Hmit the support of the input dis- 
tribution. This can occur, e.g., in the presence of a peak power constraint, 
or a maximum bandwidth constraint, in a channel — such as (T) — in which 
each input symbol has a power or bandwidth cost. Therefore, in computer 
simulations, one should be wary of peaks in C{'P) which are close to the 
peak power 
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Fig. 1. Fiber-optic communication system (after 0)- 



signal space dimensions occupied by the transmitted signals 
during the entire evolution (capacity per maximum input 
cost — see Section IV-AI and (|29]l). Spectral broadening is an 
important issue in zero-dispersion fibers where the channel is 
strongly nonlinear |.13| . 

b) Single-user Channels with Memory: The SPM term of 
each user is available at the receiver for that user Its determin- 
istic part, if needed, can be removed, e.g., by backpropagation 
and its (signal-dependent) stochastic part can be handled by 
coding and optimal detection over a long block of data (e.g., 
maximum likelihood sequence detection). Deterministic or 
stochastic intra-channel effects do not limit the capacity if such 
joint detection is performed. 

This may not be true for a receiver which performs isolated 
symbol detection. Whereas ISI is a deterministic distortion in 
traditional linear channels, for a nonlinear waveform channel 
ISI can have a stochastic component as well. As a result, unlike 
a waveform additive white Gaussian noise channel where ISI 
is removed completely by channel inversion, in nonlinear 
systems such channel inversion (e.g., by backpropagation) 
leaves a residual stochastic ISI for each symbol. In this case, 
the receiver based on backpropagation and isolated symbol 
detection gives rise to an ISI-limited system with a suboptimal 
performance. 

c) Multi-user Channels with Interference: Inter-channel 
interference in the frequency domain is the dual of intra- 
channel ISI in the time domain; the difference is that coopera- 
tion and joint detection is generally not possible among users. 
In an optical fiber network, many users have to share the same 
optical fiber link. The signals of some of these users join and 
leave the optical link at intermediate points along the fiber, 
leaving behind a residual nonlinear impact. Thus we should 
assume that each user has access only to the signal in its own 
frequency band, and the signal of users k', where k' ^ k, is 
unknown to the fc* user 

In a nonlinear channel where, by definition, additivity is 
not preserved under the action of the channel, multiplexing 
signals in a linear fashion, e.g., by adding them in time or in 
frequency — in reconfigurable optical add-drop multiplexers 
(ROADMs) — leads to inter-channel interference. The interfer- 
ence resulting from these signals is unavoidably treated as 
noise and ultimately limits the achievable rates of such optical 
fiber systems. 

Assuming that the coefficients qk{z) remain independent 
in (|4|i during the evolution, the XPM term is statistically 



regulated, so that as iV — > oo, 

l^k 

Thus XPM does not influence the capacity of optical fibers 
significantly. 

FWM, on the other hand, is stronger (cubic in the signal 
amplitude) and is not averaged out like XPM is [14|. It is 
obvious that this interference will blow up when increasing the 
common average power V (N ^ 1), ultimately overwhelming 
the signal and limiting the achievable rate. The per-degree-of- 
freedom achievable rates of the WDM method versus power, 
liV), is noise-limited in the low SNR regime, following 
log(l + SNR), and interference-limited in the high SNR 
regime, decreasing to zero ||4], lfT4l . ifTSl : see Fig. |3] 

To summarize the preceding discussion, the transmission 
rates achievable over the nonlinear Schrodinger channel de- 
pend on the method of transmission and detection, as well as 
the assumptions on the model. One can assume a single-user or 
a multiple-user channel, with or without memory, and with or 
without optical filtering. It is thus important, when comparing 
different results, to clarify which modelling assumptions have 
been made. 

C. Inter-channel Interference as the Capacity Bottleneck 

It follows that while intra-channel interference can be han- 
dled by signal processing, inter-channel interference ultimately 
limits the achievable rate of optical fiber networks. The current 
practice in fiber-optic communications is to send a linear sum 
of signals in time {i.e., a pulse train) and in frequency {i.e., 
WDM) in the form of dU, both of which are not particu- 
larly well suited for the nonlinear fiber channel. Thus we 
identify ROADMs as the main culprit for capacity limitation 
in currently deployed optical fiber networks. A modification 
of these devices is needed so that the incoming signals are 
multiplexed in a nonlinear fashion, exciting non-interacting 
degrees-of-freedom of the NLS equation. 

To illustrate the effect of inter-channel interference in the 
application of the WDM method to the nonlinear fiber channel, 
we have simulated transmission of 5 WDM channels over 
2000 km of standard single mode fiber with parameters from 
Table U At the end of each span, a ROADM filters the central 
channel of interest (COI) at 40 GHz bandwidth, and adds 
four independent side-band signals randomly from a common 
constellation. At the channel output, the COI is filtered and 
backpropagated according to the inverse NLS equation. Fig. |2] 
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Fig. 2. (a) 5 WDM channels in the frequency, with the channel of interest (COI) at the center Neighbor channels are dropped and added at the end of 
the each span, creating a leftover interference for the COI. (b) Channel of interest at the input (dotted rectangle) and at the output after backpropagation 
(sohd curve). The mismatch is due to the fact that the backpropagation is performed only on the channel of interest and the interference signals cannot be 
backpropagated. (c) Inter-channel interference is increased with signal intensity. 
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Achievable rates of the WDM method under weak and strong inter- 
interference. 



SO that both the deterministic inter-channel and inter-symbol 
interferences are simultaneously zero for all users of a mul- 
tiuser network. Such interference cancellation follows directly 
from the integrability of the cubic nonlinear Schrodinger 
equation in 1 + 1 dimensions, and is generally not feasible for 
other types of nonlinearity (even if the nonlinearity is weaker 
than cubic!). The achievable rates of the suggested scheme are 
expected to be higher than those of the WDM method and the 
scheme has numerous other potential advantages, mentioned 
in Section U 

Remark 1. Note that for an information-theoretic study, it is 
not necessary to perform deterministic signal processing such 
as backpropagation. One is only concerned with transition 
probabilities, which include effects such as rotations or other 
deterministic transformations. Backpropagation just aids the 
system engineer to simplify the task of the signal recovery 
(being an invertible operation, it does not change the informa- 
tion content of the received signal). 



compares the input and output frequency-domain waveforms, 
after backpropagation of the COI. Fig.|2la) shows five random 
instances of the multiplexed signals; note that, because out-of- 
band signals are filtered and replaced at various points along 
the fiber, the out-of-band signals at the receiver are not related 
to the transmitted ones. Only the COI is backpropagated. 
Comparing Fig. |2c) with Fig. EJb), it can be seen that 
the nonlinear inter-channel interference is stronger at higher 
powers. 

The estimated information rate of this system is shown 
in Fig. [3] It is clear from both Fig. |2] and Fig. |3] that, as 
the average transmitted power is increased, the signal-to-noise 
ratio in the COI, and as a result the information rate, vanishes 
to zero. Note that this effect can also be predicted by a simple 
SNR analysis at the receiver 

Using the mathematical and numerical tools described in 
Parts I and II, this paper aims to show that it is possible to 
exploit the integrability of the nonlinear Schrodinger equation 
and induce a /c-user interference channel on the NLS equation 



III. The Discrete Spectral Function 
A. Background 

Here we briefly recall the definition of the discrete spectral 
function in the context of the nonlinear Schrodinger equation. 
We first consider the deterministic version of (H), where the 
noise is zero. Later, we will treat noise as a perturbation of 
the noise-free equation. 

The nonlinear Fourier transform of a signal in dl) arises via 
spectral analysis of the operator 

^ = ^(-|(t) lf)=^-(^^3 + Q), (5) 
where D — 

Let v{t, A) be an eigenvector of L with eigenvalue A. Fol- 
lowing [Part I], the discrete spectral function of the signal 
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propagating according to ([1} is obtained by solving the the equations 

At;, or equivalently 



Zakharov-Shabat eigenproblem Lv 

-jA q{t) 



Vt 



-q*{t) J A 



-oo, A) 



-j\t 



(6) 



where the initial condition was chosen based on the assump- 
tion that the signal q{t) vanishes as \t\ — oo. The system of 
ordinary differential equations (|6} is solved from < = — oo 
to t = +00 to obtain v{+oo, A). The nonlinear Fourier 
coefficients a(A) and b{X) are then defined as 



6(A) = lim V2{t,\)e'^^*- 



Finally, the discrete spectral function is defined on the upper 
half complex plane C+ = {A : 3(A) > 0}: 



9(A,) 



where prime ' denotes differentiation and Xj are the isolated 
zeros of a(A) in C+, i.e., solutions of a{Xj) = 0. The 
continuous spectral function is defined on the real axis A G R 
as g(A) = 6(A)/a(A). 



B. Modulating the Discrete Spectrum 

Let the nonlinear Fourier transform of the signal q{t) be 
represented by q{t) < — > {q{X),q{Xj)). When the continuous 
spectrum q{X) is set to zero, the nonlinear Fourier transform 
consists only of discrete spectral functions q{Xj), i.e., N com- 
plex numbers Ai, . . . , Aat in C+ together with the correspond- 
ing N complex spectral amplitudes q{Xi), . . . , q{XN). In this 
case, the inverse nonlinear Fourier transform can be worked 
out in closed-form, giving rise to A^-soliton pulses |17|. The 
simplified expressions, however, quickly get complicated when 
N > 2, and tend to be limited to low-order solitons. 

One can, however, create and modulate these multisolitons 
numerically. In this section we study various schemes for the 
implementation of the inverse NFT at the transmitter when 
q = 0. 

1 ) Discrete Spectrum Modulation by Solving the Riemann- 
Hilbert System: The inverse nonlinear Fourier transform can 
be obtained by solving a Riemann-Hilbert system of integro- 
algebraic equations or, alternatively, by solving the Gelfand- 
Levitan-Marchenko integral equations. Great simplifications 
occur when q{X) is zero. For instance, in this case the integral 
terms in the Riemann-Hilbert system vanish and the integro- 
algebraic system of equations is reduced to an algebraic linear 
system, whose solution gives rise to A^-soliton pulses. 

Let V{t,Xj) and V{t,X*) denote the scaled eigenvectors 
associated with Aj and A* defined by their boundary conditions 
at +00 (they are denoted by and in [Part I]). Setting 
the continuous spectral function q{X) to zero in the Riemann- 
Hilbert system of [Part I], we obtain an algebraic system of 



V{t,X*m) 



V{t, K 



q{X,)e^^^''V{t,X,) 



E 

1=1 



N 

E 



A* J — Xi 
q*{X,)e-^^^'i*Vit,X*) 



Xr 



X* 



(7) 



Define 



V2xAf 
V2xAf 



KatxTV 

(Jl)2xAf 
(J2)2x7V 

32xN - 



{V{t,X,) 


V{t,X2) ■■ 


V{t,XN)), 


{V{t,X*i) 


V{t,X*2) ■■ 


Vit,X*N)), 


A*-Ai 
A* -As 


Aj-Ai 

2iAot 

A*-A2 


A^-Ai 
Aj^ — A2 


\ A*— Ajv 


gjve « 
As — Ajv 


A^ — Atv 



1 1 





1 1 

fl 1 



JiK* 



F,vxi = (gie^J^i* 






1)^ 



e 

2jA2t , 



qNG 



'Nxl = yqid--'- 926 

In this notation, the algebraic equations dTjl are simplified to 

V = Ji+VK V = J2-VK*. 

Note that K* is the complex conjugate of K (not Hermitian). 
Therefore 

V = (J2 - JiK*) {In + KK*)"' = J (Iw + KK*)~' , 

T c^, , KK*)"\ The iV-soliton formula IS given 



and V2 = e-' (Ijv H 
by 

q{t) 



-2je^ (Ijv 



(8) 



The right hand side is a complex scalar and has to be evaluated 
for every t to determine samples of q{t) everywhere. 

Example 1. It is useful to see that the (scaled) eigen- 

2 

is 



vector for a single soliton with spectrum ^ 2) 



v{t,X;z) = 



1 / e"^* 

- sech[w(i - to)] [^(t-to) 



where $ = at + (a^ - uj^)z - Zgo - f and tg = ^ log - 
2az. The celebrated equation for the single soliton obtained 
from (|8]l is 



j{t) = - jwe^^"*e"^'^"sech (u;(t ± to)) 



(9) 



From the phase-symmetry of the NLS equation, the factor —j 
in (|9]l can be dropped. The real and imaginary part of the 
eigenvalue are the frequency and amplitude of the soliton. Note 
that the discrete spectral amplitude q{X) is responsible for the 
phase and time-center of the soliton. 
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Unfortunately the Riemann-Hilbert system is found to be 
occasionally ill-conditioned for large and, in the current 
form, is not the best method for numerical generation of iV- 
solitons. 

2) Discrete Spectrum Modulation via the Hirota Bilin- 
earization Scheme: It is also possible to generate multisolitons 
without solving a Riemann-Hilbert system or directly using the 
NFT. A method which is particularly analytically insightful is 
the Hirota direct method 1 17 1. It prescribes, in some sense, a 
nonlinear superposition for integrable equations. 

The Hirota method for an integrable equation works by 
introducing a transformation of the dependent variable q to 
convert the original nonlinear equation to one or more homoge- 
neous bilinear PDEs. For integrable equations, the nonlinearity 
usually is canceled or separated out. The resulting bilinear 
equations have solutions that can be expressed as a sum of 
exponentials. Computationally, bilinear equations are solved 
perturbatively by expanding the unknowns in terms of the 
powers of a small parameter e. For integrable equations, this 
series truncates, rendering approximate solutions of various 
orders to be indeed exact. The bilinearization transformation 
has been found for many integrable equations ifTTl , taking 
on similar forms that usually involve the derivatives of the 
logarithm of the transformed variable. 

Let us substitute q{t,z) = G{t, z) / F{t, z), where, without 
loss of generality, we may assume that F{t, z) is real-valued. 
To keep track of the effect of nonlinearity, let us restore the 
nonlinearity parameter 7 in the NLS equation. Plugging q — 
G/F into the NLS equation 



we get 



jlz qtt + 27|gpg, 



J {G,F - FG,) = FGtt - 2FtGt - GFu 



G 



F 

= FGtt — '^FtGt + GFtt 

+ 2^^±«^G, (10) 
F 

where we have added and subtracted 2GFtt- Equation ( fTOl i is 
trilinear in F and G. It can be made bilinear by setting 



3{G,F - GF,) = FGtt ~ 2FtGt + GFtt, 
F^+j\Gf-FFtt^O. 



(11) 



It is very convenient (though not necessary) to organize (ITTl i 
using the Hirota D operator 



resulting in 



{jD, + Dl)FG = 0, 

DfFF = 2710^. 



(12) 
(13) 



Note that the D-operator acts on a pair of functions to 
produce another function. Note further that (ITZt does not 
depend on the nonlinearity parameter 7. That is to say, 
the nonlinearity has been separated from equation (fTZt . For 



some other integrable equations {e.g., the Korteweg-de Vries 
equation) for which one gets only one bilinear PDE, the 
nonlinearity parameter is in fact canceled completely. 

Bilinear Hirota equations (fT2] |-(fT3]l have solutions in the 
form of sum of exponentials. As shown in Appendix |A] F 
and G are obtained as [17], [18]: 



F{t,z)= 5i(b)exp(b^x + b^Rb) 

b=0,l 

G(t,z)= (52(b) exp (b'^x + b^Rb) 



b=0,l 



where b = is a binary vector, b, = {0, 1}, x = [Xi]f^-^^, 

]2N 



R = is the Riemann spectrum 



i>3 

'2\og{uj,+Uj) 1^1,2,. ..,N-j = N + l,...,2N 
'2\og{uji -LOj) N + l,...,2N;j N + 1,...,2N 



and 



N N 

otherwise 

{N N 
1 Eb^^^+Eb^+N 
i=l i=l 
otherwise 

Note that, using the identity 9tt log F — " ^2 — U2l is 
reduced to \q{t, z)p — -f^^dtt logi^. Therefore the amplitude 
of q is captured by the real-valued function F, while G 
contains the phase of the signal. 

Two important observations follow from the Hirota method. 
Firstly, multisoliton solutions of the NLS equation in the 
F,G domain (q = G/F) are the summation of exponentially 
decaying functions e~"*e^"*, each located at a frequency u. 
That is to say, while plane waves e^*^'^*^'^^^ are the natural 
basis functions that solve linear PDEs, for integrable systems, 
exponentially decaying functions are suitable. The addition 
of the decaying factor e^"* (when t > 0) is the point at 
which the nonlinear Fourier transform diverges from the linear 
Fourier transform [18j. Secondly, for each individual soli ton 
term, the Hirota method adds two-way interaction terms, 
three-way interaction terms, etc., until all the interactions are 
accounted for In this way, the interference between individual 
components is removed, as shown schematically in Fig. |4] 
Tables HI] and HI] show these interaction terms for = 1, 2, 3. 

While the Hirota method reveals important facts about 
signal degrees-of-freedom in the NLS equation, it may not be 
the best method to compute multisolitons numerically. There 
are {^^) ~ 2^^ and Q^^) ~ 2^^ terms in F and G 
respectively, and unless one truncates the interaction terms at 
some step, the complexity quickly grows, making it hard to 
compute A^-solitons for N > 10. 

3) Recursive Discrete Spectrum Modulation Using Darboux 
Transformation: Multisoliton solutions of the NLS equation 
can be constructed recursively using the Darboux transforma- 
tion (DT). The Darboux transformation, originally introduced 
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{bi}- 

{h}- 

{bN} 



bk to flfe 
mapper 



a{-ai+juji)t 



a{-a.2+juJ2)t 



G 

interaction 
terms 



F 

interaction 
terms 



6 



q = G/F 



Fig. 4. Hirota modulator for creating Af-solitons. 



TABLE II 

The Structure of the Interaction Terms in F and G. 





F 


N =1 


1 + e-^i+^i* 


2 


1 + e^i 


_|_ ^^2+^2 -f e^i+^2 -f e^2+^r 


3 


^f,Xi+X2+x^-\-X2 f,Xi+X2+x^+x^ -\- 

__^^X2+X3,+X'^-\-X^ qX2+X 


'eXi+X2 gX2+xl _j_ gXi+x| _j_ gX3+xj" ^ gX2+Jf| ^ gXa+xJ^ _j_ 

I+X2 + X2+X3 _j_ gXi+X3 + X^ _^ ^Xi+X^+X^ +X^ ^ g-'^l+-'^3+-'^2 "'""'^3 
3 + X'+X2* gX2+X3 + X2*+XJ^ _,_ gXi+X2+X3+X*+X2*+X| 




G 


N = 1 




2 




_ g^l gXi+X2+X^ _j_ gXi+X2+X2 


3 


^X2+X^+Xl _^ qX2+X^+X2 j_ ^X2+X^+X^^ 


+X2+X2 ^ gXi+X2+X3 _|_ gXi+X3+X^ gXi+X3+X| ^ gXi+Xs+Xg 

+ (^e^i+X2+X3+xi+x^ _j_ gXi+X2+X3+xJ+X3* ^ gXi+X2+X3+xJ+x|^ 



V (t, \k+i;q{t: Ai, ■ ■ ■ , A^)) 

(a) 



(?{i;Ai,---,Afc+i) 



v{t,fi-,q{t;Xi,--- ,Xk)) ■ 

v{t,Xk+i;q{t;Xi, - ■ ■ ,Xk)) ■ 



(b) 



Fig. 5. Updates in tlie Darboux transformation: (a) signal update; (b) eigenvector update. 



in the context of the Sturm-Liouville differential equations 
and later used in nonlinear integrable systems, provides the 
possibility to construct a solution of an integrable equation 
from another solution lfT9l . For instance, one can start from 
the trivial solution g = of the NLS equation, and recursively 
obtain all higher order A^-soliton solutions. This approach is 
particularly well suited for numerical implementation. 
Let x{t, A; q) denote a solution of the system 

xt = P{C,q)x, 

x,=M{C,q)x, (14) 



for the signal q and complex number C = A (not necessarily an 
eigenvalue of q), where the P and M are 2 x 2 matrix operators 
defined in [Part 1]. It is clear that tl satisfies 

(fl4l i for C, ^ C,*, and furthermore, by cross-elimination, q is 
a solution of the integrable equation underlying (fT4t . 

The Darboux theorem is stated as follows. 

Theorem 1 (Darboux Transformation). Let <j){t, A; q) be a 
known solution of (114b . and set E = STS^^, where S = 
A), A)] and T — diag{\,\*). If v{t,fj,;q) satisfies 
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g{t) 

v(t,\ 
?;(i,A2;0) 
A3;0) 

?;(i.Aiv_2;0) 
?;(i.Aiv_i;0) 
«(i,Ajv;0) 



= 0^ 
i;0) -AjE) 



U(E> 



g(t;Ai) 

■ y(i,A2;g(t;Ai)) 
■i,(i,A3;g(t;Ai)) 
■i<(i.A4;(i(t;Ai)) 



■i'(t,Ajv_i ;(/(*; Ai)) 
— ■»(*, Aiv;(j(t; Ai)) 




Ai, Aa) — 

■u(t, A:j;(}(t; Ai, A2)) -f*(E; 
■y(t,A4;9(i;Ai,A2)) 
■y(t,A5;(/(i;Ai,A2)) 




q{t; Ai, . . . , Ajv_i) 
D(t, Ajv;(j(t;Ai,...,Ajv_i)) 



(j(t;Ai,...,Ajv) 



Fig. 6. Darboux iterations for tlie construction of an A^-soIiton. 



( I14l l, f/ien u{t,^]q) obtained from the Darboux transform 

u(t,t^;q) ^ -^)v{t,ii;q), (15) 
satisfies ( fT4] l we/Z, /or 

Furthermore, both q and q satisfy the integrable equation 
underlying the system (1141) . 

Proof: See Appendix IbI ■ 
Theorem [T| immediately provides the following observa- 
tions. 

1) From <j>{t,\;q) and v{t,fi;q), we can obtain u{t,fi;q) 
according to ( fTSl l. If /i is an eigenvalue of q, then /.i is 
an eigenvalue of q as well. Furthermore, since u{t, /i = 
A; = 0, A is also an eigenvalue of q. It follows that 
the eigenvalues of q are the eigenvalues of q together 
with A. 

2) g is a new solution of the equation underlying ( fT4] i. 
obtained from q according to ( fTSl l, and u{t, /i; q) is one 
of its eigenvectors. 

These observations suggest a two-step iterative algorithm 
to generate A^-solitons, as illustrated in the Figs. |5]-|6] De- 
note a fc-soliton pulse with eigenvalues Ai,A2,...,Afc by 
q{t; Xi, X2, ■ ■ . , \k) The update equations for the 

recursive Darboux method are given in Table HII] Note that 
v{t, \j , q^'^^^'f ) can also be obtained directly by solving the 
Zakharov-Shabat system ^ for q'^'^^^K It is however more 
efficient to update the required eigenvector according to Ta- 
ble Uni The algorithm is initialized from the trivial solution 
q — Q. The initial eigenvectors in Fig. |6] are chosen to be the 
(non-canonical) eigenvectors v{t, \j) — [Aje^^^^*, Bje^^^^'^ . 
The coefficients Aj and Bj control the spectral amplitudes and 
the shape of the pulses. For a single soli ton, Aj — exp{jZq) 
and Bj — \q\. 

Remark 2. It can be observed numerically that the discrete 
spectrum of a long random bandlimited wavetrain, e.g., that 
formed from a sine basis, appears to contain most of the 
wavetrain energy. This implies that a realization of a stationary 
bandlimited stochastic process x{t), t E [—T/2, T/2], is likely 
to be a multisoliton as T 00. In other words, a large class 



of input signals of interest are indeed multisolitons (though 
they may not look so in appearance). 

C. Evolution of the Discrete Spectrum 

Recall that the imaginary and real parts of the eigenvalues 
correspond, respectively, to soliton amplitude (energy) and 
frequency. If the discrete spectrum of the signal lies completely 
on the imaginary axis, the A^-soliton does not travel while 
propagating (with respect to a traveling observer). The indi- 
vidual components of an A^-soliton pulse with frequencies Ai 
off the juj axis travel in retarded time with speeds proportional 
to 3fiAi (frequency). 

The manner of 7V-soliton propagation thus depends on the 
choice of the eigenvalues. An A^-soliton signal is essentially 
composed of N single solitons coupled together, similar to a 
molecule which groups a number of atoms. If the eigenvalues 
have non-zero distinct real parts, various components travel at 
different speeds and eventually when z — J- 00 the A^-soliton 
decomposes into N separate solitons 

N 

q{t, z)^Y^ ^.g-j(a.t+fe,2+0o) sech{uj,{t -U- a,z)), 
1=1 

where A^ = {ai + juJi)/2 are eigenvalues, ki — {of —ujf)/2, 
and ti is the time center. This breakdown of a signal to its 
individual components, while best observed in the case of 
multisolitons, is simply a result of group velocity dispersion 
and exists for all pulses similarly (including sine functions). 
The extent of breakdown and shift depends on a variety 
of factors, such the length of the fiber, number of mass 
points, fiber dispersion and dispersion-management schemes. 
If dispersion is not managed, the effects of pulse broadening 
must be carefully considered. 

D. Demodulating the Discrete Spectrum 

To demodulate a multisoliton pulse, the eigenproblem ^ 
needs to be solved. There is limited work in the mathematical 
literature concerning the numerical solution of the Zakharov- 
Shabat spectral problem (|6j. In Part [II], we have studied 
methods by which the nonlinear Fourier transform of a signal 
may be computed numerically. In particular, in this paper we 
use the layer-peeling and Ablowitz-Ladik methods described 
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TABLE III 

Update Equations for the Recursive Darboux Method 



Eigenvector update: 

vi{t,\,,q(>'+-')) = -{(A, - Afc+i)|i,i(t,Afc+i,gW)p + (Aj - A*_^^)|i,2(t,Afc+i,gW)|2},,i(t,Aj,g('=)) 
+{A^+i - Afc+i)«;i{t,Afe+i,qW)^'{t,Afe+i,g('=))«2(i,Aj,qW), 

«2(t,A,,g('=+i)) = -{(A, -A*_^,)|i,i(i,Afe+i,gW)p + (A, -A,+i)|i,2(t,A,+i,gW)|2}t,2(<,A,^ 
-(Afc+i-A*_^jK{t,Afe+i,gW)^2{t,Afc+i,gWK(i,A,,q('=)), 

for k = 1, . . . , N and j = k, . . . , N . 
Signal update: 



in Part [II] to estimate the discrete spectrum. The reader is 
referred to [Part II] for the details. 

IV. Statistics of the Spectral Data 

In this section we generalize the deterministic model con- 
sidered so far to include the effects of amplified spontaneous 
emission (ASE) noise during signal propagation. We present 
a method to approximate the statistics of the spectral data at 
the receiver 

Remark 3. In Section III-BI we identified inter-channel in- 
terference in multiuser WDM networks as the intractable 
factor limiting the achievable rates of the current methods at 
high launch powers. In comparison with the effect of inter- 
channel interference on achievable rate, noise is of secondary 
importance, and hence we do not intend to provide here a 
comprehensive analysis. 

The generalized NLS equation that includes an additive 
space-time noise term is generally not integrable0 Further- 
more, the addition of noise disturbs the vanishing or periodic 
boundary conditions usually assumed in the development of 
the nonlinear Fourier transform. One may therefore question 
whether the NFT is in fact well defined in this case. Fortu- 
nately, since the ASE noise power in optical fibers is quite 
small compared to the signal power for SNR > 10 dB, one 
can treat the noise as a small perturbation, and still safely use 
the NFT. 

Calculation of the exact statistics of the spectral data at the 
receiver can be quite cumbersome. This is essentially because 
the NLS equation with additive noise, unlike the noise-free 
equation, has little or no structure, giving rise to complicated 
variational representations for the noise statistics. Even if exact 
expressions could be obtained, it is unlikely that they would 
be suitably tractable for data communications studies. One 
can, however, approximate these statistics using a perturbation 
theory, or simulate them on a computer In this paper we follow 
a perturbation theory approach. 

Remark 4. Note that in this paper, we have not included the 
effects of fiber loss in our model. This assumption is justified 
in systems using distributed (Raman) ampUfication (DRA), 

^However, in some special cases involving real-valued multiplicative noise, 
the generalized NLS equation may still be integrable [20] and the NFT can 
be used without approximation. 



which essentially trades loss for noise, which is treated in 
this section. 

Noise can be introduced in a lumped or distributed fashion. 
The former case arises in systems using erbium-doped fiber 
amplifiers located at the end of each fiber span. In this case, 
the randomness is on initial conditions at the input of each 
fiber span, and the NFT can be used without approximation. 
We refer to this type of noise (where the randomness comes 
from the initial conditions and not the medium) as lumped 
noise. 

If noise is injected continuously throughout the fiber as 
a result of DRA, one has distributed noise. Here we can 
discretize the fiber into a large number of small fiber segments 
and add lumped noise at the end of each segment. Each such 
injection of noise acts as a random perturbation of the initial 
data at the input of the next segment. The DRA can thus be 
approximately treated similar to the lumped noise case. Indeed, 
when the stochastic NLS equation is simulated numerically, 
the step size in the split-step Fourier method can be as large as 
the distance between erbium-doped fiber amplifiers (EDFAs), 
i.e., the dynamics of the pulse propagation under DRA and a 
large number of EDFAs is about the same. 

In this section, we study the effect of the these two types 
of perturbations on the NFDM channel model. We assume 
that the noise vanishes, or is negligible, as \t\ co and has a 
finite energy such that the signal remains absolutely integrable 
almost surely. 

A. Perturbation of Eigenvalues 

1 ) Lumped Noise: The NFT arises in the spectral analysis 
of the L operator (|5). We can easily analyze the perturbations 
of the eigenvalues A as a result of the changes in the signal 

q{t). 

Let us denote the nonlinear Fourier transform of q{t) in 
the absence of noise by {q{X),q{Xj)). As the signal q{t) at 
the input of each small segment is perturbed to q{t) + en{t) 
for some small parameter e and (normalized) noise process 
n{t), the (discrete) eigenvalues and spectral amplitudes deviate 
slightly from their nominal values. Separating the signal and 
noise terms, the perturbed v and A satisfy 

{L + eR)v = Xv, R= f , (17) 
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where R is the matrix containing the noise. The study of 
the nonlinear Fourier transform in the presence of (small) 
input noise is thus a perturbation theory of the non-self-adjoint 
operator L + eR. 

Perturbation theory of Hermitian operators is well-studied 
(e.g., in quantum mechanics). The Zakharov-Shabat operator 
in ( fTTl ) is however non- self- adjoint. Unfortunately most useful 
properties of self-adjoint operators (in particular, the existence 
of a complete orthonormal basis from eigenvectors) do not 
carry over to non-self-adjoint operators. For either type of 
operator, deterministic perturbation analysis already exists 
in the literature [2r|-[24]. These results, however, are non- 
stochastic and the distribution of the scattering data is still 
lacking. A very interesting work is [25] in which authors 
calculate the distribution of the spectral data for the special 
case in which the channel is noise-free and the input is a 
white Gaussian stochastic process. There is also numerous 
work pertaining to the statistics of the parameters of a single 
soliton; see e.g., ll26l and references therein. 

For the non-self adjoint operators L, the orthogonality that 
we require is between the space of left and right eigenvectors 
of L associated with distinct eigenvalues; that is to say, 
between eigenvectors of L associated with A and eigenvectors 
of the adjoint operator L* associated with /i ^ A*. Let us equip 
the space of eigenvectors with the usual inner product 



{u,v) 



{uivl + U2V2) dt. 



It can be verified that the operator S3L is self-adjoint, i.e., 

{u, Y,^Lv) ~ {Yi^Lu, v), where S3 = diag(l, —1) is the Pauli 
matrix. 

We use a small noise approximation, expanding unknown 
variables in noise level e as 



v{t) ^ v^°\t) + ev 



A = A(°)+eA(i)+e2A(2) + 



(18) 
(19) 



We assume these variables are analytic functions of e so that 
the above series are convergent. Plugging (IT¥i-(IT9^ into (ITtI i 
and equating like powers of e, we obtain 



(L - A("))z;(2) = -{R - A(i))t;(i) + A^^)^^, 



(20) 



The first term implies that v'^^'^ and A'^"^ are eigenvalue and 
eigenvector of the (nominal) operator L. To eliminate vi from 
the second equation, we take the inner product on both sides 
of (I20I 1 with some vector u; the left hand side of the resulting 
expression is 

(w,(L-A("))«(i)) = ((L-A(°))*^i,z;(i)) 

= ((L* - A(*')*)m,w(i)). (21) 

To have the right-hand side of (l2Tl i vanish, we can choose 11 to 
be an eigenvector of the adjoint operator L* associated with an 
eigenvalue ^x = A^")*, i.e., {L* - A(")*)m = 0. Since L*{q) = 



L{—q), if Lv = A^^^w, it can be verified that L*u — X^^^u for 
u = [vi,-V2] = S3W. Setting u = = S3W^°'(*> ^*) 

(u(o),w(o)) ■ 

Using similar calculations we obtain A*-^-* 
A(2) 



and so on. 

To summarize, the fluctuations of a discrete eigenvalue A„ 
is given by 

(Ur, , Rv„) „, 

Xn^Xn + e^ V + Oe', n^l,2,...,N, 

{Un,Vn) 

where " denotes the eigenvalue after noise addition. It follows 
that the perturbation of the eigenvalues is distributed, to 
the first order, according to a zero-mean complex Gaussian 
distribution. 

Continuing this approach to find higher-order fluctuations 
of eigenvectors, v^'^K fc > 1, is not straightforward because 
the underlying operator is not self-adjoint. 

2) Distributed Noise: Consider now the perturbed NLS 
equation 

jqz ^ qtt + '2\qfq + <^n{t,z), (22) 

where e is a small parameter (noise level), and n{t, z) repre- 
sents the combined effects of the signal loss and the distributed 
noise. 

Let us represent (l22t with the same L and AI of the noise- 
free equation and now let A vary with z. The equality of mixed 
derivatives Vtz — v^t gives 

This, upon re-arranging and using (l22t . simplifies to 



v = 0. 



XzV = eRv, R 



-R. 



Note that, as before, we do not have v{t, z) a priori because, 
according to (|5]l, it depends on the noisy signal q{t^ z) and 
X{z), both of which are unknown. However, if the noise level 
e is small, we can expand v{t, z) and A in powers of e as 
([H and (Hg to obtain (A(°))^ = and iJu^o) = (A^i))^^^^) 
{i.e., (A^^'')2 appears as a time-independent eigenvalue of 
R{t)). Taking the inner-product with u'^'^^ — Esw^^' (i, A*) on 
both sides of _Rt;(") = (A'^-')^u'''^', we obtain the first-order 
variation of eigenvalues 



(Ai). = 



(ii(o),t;(o)) 



(23) 



It follows from (|23] | that the distribution of the deviation 
of the eigenvalues is approximately a zero-mean conditionally 
Gaussian random variable. The variance of this random vari- 
able is signal dependent, and although eigenvectors of an A^- 
soliton can be represented as a series from Darboux transform, 
it is best calculated numerically if > 2. 

Example 2. Consider the single soliton of Example[T] It can be 
verified that {Y.zv{t,X*),v{t,X)) = |(1 + ^) = ^, where 
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we assumed |g| — w so that the soliton is centered at to = 0. 
Furthermore, 

{v{t,X*),Rv{t,X)) 

'(..(t,A*K(t,AK + ..(M*K(M)n)dt 

i sech(a;t) {v2{t, A*) + v;{t, A)) mdt 
+j J i scch{ujt){v2it, A*) - v;{t, X))^ndt 
= — - J scch{ujt)dindt — ^ J sech(ciji)tanh(cji)3fidt, 

where n = nexp(j$) has the same statistics as n. It follows 
that 



This is a conditionally Gaussian random variable (for each A) 
with variance 

/>oo 



az = - I sech(r)5RndT, (24) 
ujz = — J sech(T)tanh(T)3fidT. (25) 

The Gordon-Haus effect easily follows from the equation. 
Note that in fiber optics noise is added to the signal, i.e., n in 
this subsection should be replaced with jn. 

B. Perturbation of Spectral Amplitudes 

Using a similar perturbation approach, we can study the 
influence of noise on spectral amplitudes as well. 

As reported in [Part II], discrete spectral amplitudes are 
chaotic even when noise is zero. We thus here consider first- 
order fluctuation of continuous spectral amplitudes, under the 
lumped noise model. 

Continuous spectral amplitudes are obtained by solving the 
following Riccati equation [Part I] 

d2/(i, A) 



where 



+ {q{t) + en{t)) y'{t, A) + (q* (t) + en*{t)) = 0, 

2;(-oo,A) = 0,(26) 



q{t) = exp(2jAt) and q{X) = lim e^^^^y(t,X). 

One can write a Fokker-Planck equation for y{t,X) in 
||T3l . ||27l . If the signal q{t) is known, e.g., q{t) = as in 
the resulting equation can be solved. In general, however, we 
can expand 

y{t, A) = yo{t, A) + eyi(t, A) + e^y2{t, A) + • • • , 
p(A) = po(A) + epi(A) + eV(A) + • • • , 

and equate Uke powers of e. We obtain that po{X) is the 
spectral amplitude when noise is zero. Define 



G(t,A)= J qiT,X)yo{T,X)dT, 

— OO 

and h{t, A) = n{t)yl(t, A) + n^{t). Then 



where we assumed that noise is delta-correlated. If for some 
signals, E\pi{X)\'^ is unbounded, the above perturbation ex- 
pansion fails and a slow-scale variable T = et needs to be 
introduced. 

V. Some Achievable Spectral Efficiencies Using 

NFT 

We turn now to a numerical study of iV-soliton data 
transmission schemes, providing simulation results to quantify 
the gains that are achievable. 

We start off with classical on-off keying soliton transmis- 
sion, in which, in any symbol period Tg, either zero or a 
fundamental soliton is sent. To improve upon the classical 
binary soliton transmission system, we consider an {N > 2)- 
soliton system, occupying the same time interval as the on-off 
keyed soliton system, and maintaining the same bandwidth 
requirements. The location of the eigenvalues and the values 
of the discrete spectral amplitudes can be jointly modulated for 
this purpose. We shall see that the effective useful region in the 
upper-half plane to exploit the potential of discrete eigenvalue 
modulation is limited by a variety of factors. In this paper, 
we will provide only a brief discussion of the possibility of 
modulating the continuous spectrum. 

Modulating the nonlinear spectrum generates pulses with 
variable width, power, and bandwidth. We take the average 
time, average power, and the maximum bandwidth to properly 
convert bits/symbol to bits/s and bits/s/Hz. To improve upon 
the on-off keying solitons, first we continuously modulate one 
eigenvalue in a given region (i.e., a classical soliton with vary- 
ing amplitude, width and phase). We next consider multisoliton 
systems with a number of constellations on eigenvalues and 
discrete spectral amplitudes. 

Throughout this section, we consider a 2000 km single- 
mode single-channel optical link in which fiber loss is per- 
fectly compensated in a distributed manner using Raman 
amplification. Fiber parameters are given in Table |l] Dispersion 
compensation is not applied, as it is an advantage of the 
NFT approach that no dispersion management or nonlinearity 
compensation is required. We let pulses interact naturally, as 
atoms in a molecule, and perform signal processing at the 
receiver on these groups. The method however works for 
dispersion managed fibers as well, and in general with any 
operation that does not change integrability. 

A. Spectral Efficiency of 1 -Soliton Systems 

Soliton transmission systems typically do not have high 
spectral efficiencies. This is because the amplitude and the 
width of a single soliton are inversely related, and hence they 
require a lot of time or bandwidth per degree of freedom 
provided. Errors in a soliton transmission system occur either 
because of the Gordon-Haus timing jitter effect (which is the 
primary source of the errors, if not managed) [|3] or amplitude 
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(energy) fluctuations. It follows from Galilean invariance f5l 
that the Gordon-Haus effect exists for all kinds of pulses to the 
same extent and is not specific to solitons. This classical effect 
can be reduced with the help of suitably designed filters. We 
do not treat Gordon-Haus jitter analytically; however, system 
simulation do naturally include this effect. 

Let us first consider a classical soliton system with only one 
eigenvalue A = (a + ja;)/2. The joint density fA,n{ct,'^) at 
any fixed distance z can be obtained from (l24li-(|25]l (or by 
extracting the dynamics of a and oj from the stochastic NLS 
equation, resulting in a pair of coupled stochastic ordinary 
differential equations). 

Multiplying the stochastic NLS equation ( |22] | by q*, sub- 
tracting from its conjugate, integrating over time, and using 
integration by parts in the dispersion term, we get 



dE 
dz 



23? 



where E{z) — /|q(T, z)pdT is the energy, and Z = —n* 
is a noise process similar to n. Thus energy fluctuation 
is a signal-dependent Gaussian random variable E{z) ^ 
N'M{E{0),a^ J E{z)dz) w AfM{E{0),a^zE{0)) (E{0) > 
(T^). Ignoring the energy of the continuous spectrum, we have 
E ^ 2lu and therefore 



u{z)=u{0) + ad'-^NM{0,l) 



(27) 



The conditional probability density function (PDF) 

/oif2o(w|a;o) is 



1 



\nra^zuJo 



(28) 



and the PDF of r = ^/uj{z) is approximately a Rician 
distribution 



f[r\ro) = — e ^^/o( — ) 

1 Ir-rpf 



which is signal-independent in the high SNR regime. 

Note that we are interpreting the stochastic NLS equation 
as a soliton-bearing system in the small-noise limit. A soliton 
of the deterministic NLS equation launched into a system 
described by the stochastic NLS equation would, of course, 
have a growing continuous spectrum too; in addition, there 
would be a very small chance of creating additional solitons 
out of noise at some distance, or the soliton spectrum might 
collapse into the real axis in the A plane. All these effects are 
negligible if the energy of the launched soliton is large enough 
and the propagation distance is not exceedingly long. 

In lfT3l we have shown that a half-Gaussian density 

V 2Tr_r 

gives the asymptotic capacity for 
1 



where SNR = ^. 

Translating capacity in bits/symbol to spectral efficiency in 
bits/s/Hz depends on the receiver architecture. Assuming that 
the receiver is able to decode pulses with variable widths, the 
spectral efficiency piV) is obtained by 

p{r) = max ^ I{oj; un), (29) 

EP(c<.o) < 

where T{ujo) and P(a;o) are the width and the power of a 
single soliton with amplitude ujq and BW(S') = maxBW(a;o) 

is the maximum passband bandwidth that the signal set S 
requires for transmission (set by the min S). For a one soliton 
signal, we have approximately 



7 

Wo' 



^, BW(c^o) = 0.95l^o, (30) 



where the width T(wo) includes a guard time — four times 
the full width at half maximum power (FWHM) — so as to 
minimize the intra-channel interactions. 

Using dSOl l. the maximum spectral efficiency of a baseline 
on-off keying system is obtained to be about p « 0.15 
bits/s/Hz at the average power Pq = 0.16 mW. Note that 
the per unit cost capacity problem (l29T l is non-convex and 
hence finding the global optimum may prove to be challenging. 
Here we simply optimize mutual information and scale it 
by BW(S') X ET(a;o) evaluated at the mutual-information- 
maximizing input distribution. 

Fig. |7] shows the achievable rate and the spectral efficiency 
of a 1 -soliton system with amplitude modulation using various 
detection methods. Note that since we do not solve the opti- 
mization problem ( |29] |, the spectral efficiencies shown in the 
Fig. I3b) are only lower bounds on the actual achievable val- 
ues. Fig.|3c)-(d) shows the constellation at the transmitter and 
the "noise balls" (of radius equal to one standard deviation of 
the distance to the transmitted point) accumulated over 30,000 
simulation trials at the receiver The actual number of signal 
levels is 64 in the simulations. Calculation of the approximate 
rate is performed using the Arimoto-Blahut algorithm and is 
confirmed by numerical interior point optimization. 

Note that, as is clear from Fig. |3a)-(b), sampling signals 
at t = is clearly a bad idea; it is shown here just to see the 
effects of the timing jitter on 1— soliton systems. 

B. Spectral Efficiency of 2-Soliton Systems 

To illustrate how the NFT method works, we start off with 
two simple examples. These two examples are intended to 
explain the details of transmission and detection using the 
NFT, but they have not been optimized for performance. 

1 ) Modulating Eigenvalues: Consider the following signal 
set with 4 elements: 



2log(l + SNR)--, 



^1 

^3 



0, 

g(0.5j) = 1, 
g(0.25j) = 0.5, 
g(0.25j,0.5j) = (l,l). 



(31) 
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Table II VI shows the energy, duration, power and the bandwidth 
of these signals. 

We compare this with a standard on-off keying (OOK) 
soliton transmission system, consisting of 5*1 and 5*2. From the 
signal parameters given in Table HV] it follows that the OOK 
system provides about po — 0.33 bits/s/Hz spectral efficiency 
at Po = 0.1876 mW and Rq = 7.42 Gbits/s data rate. Note 
that the noise level is so small compared to the imaginary 
part of the eigenvalues that this scheme essentially achieves a 
transmission rate of 2 bits/symbol. 

The full constellation defined in (ISTT l has average power 
0.46Po and average time duration l.SSTi, where Pq and Ti 
are the power and the time duration of the fundamental soliton. 
The new signal set therefore provides a spectral efficiency 
of about X esTiWo ~ 1-2121 x pQ bits/s/Hz and operates at 
R = 1.2121 X Rq for about the same average power (0.5Po)- 
Note that without 5*4 the average power would be higher and 
in addition the improvement in the spectral efficiency would 

TABLE IV 

Parameters of the signal set in Section. [V-B II Here 
Eo= 4x0.5 = 2, To = 1.763 AT FWHM power, Ti = 5.2637 (99% 
ENERGY), Po = 0.38 AND Wo = 0.5714. The scale parameters are 
T^ = 25.246 ps and = 0.5 mW AT dispersion 0.5 ps/(nm — km). 



signal 


energy 


duration FWHM 


99% duration 


power 


bandwidth 


Si 





To 


Ti 





Wo 


S2 


Eo 


To 


Ti 


Po 


Wo 


Ss 


0.5 Eo 


2To 


2Ti 


0.25Po 


0.5Wo 


S4 


1.5 Eo 


4.25To 


2.58Ti 


0.5 8Po 


0.5Wo 



be slightly smaller compared to the on-off keying system. 
Signal 5*4 is the new signal (a 2-soliton) that goes beyond 
conventional methods. Such signals do not cost much in 
terms of time x maximum bandwidth product, while they add 
additional elements to the signal set. These additional signals 
can generally be best decoded with the help of the nonlinear 
Fourier transform. 

In this example, the receiver needs to estimate the pulse- 
duration. This can be done in many ways, e.g., using the 
NFT computations already performed: zeros of the signal in 
time can be detected when [vfe^^* ,V2e~^^^] reaches a constant 
value in steady state. This can be checked at times t — Ti, 
t = 2Ti and t — 2.58Ti. If one of the signals is zero at the end 
of another signal, one can monitor the energy of the continuous 
spectrum to make sure that it is small. If the symbol duration 
is fixed to be the maximum 2.58ri, the addition of S3 and 
S4 increases both time interval and cardinality of signal set 
such that the spectral efficiency and data rate remain constant 
(log(6)/2.58), while operating at 77% of the on-off keying 
signal power. 

Since solitons with purely imaginary eigenvalues do not 
suffer from major temporal or spectral broadening, spectral 
efficiencies at the fiber output are essentially the same as those 
at the input to the fiber. 

2) Modulating Both Eigenvalues and Spectral Amplitudes: 
We can improve upon the previous example by also modu- 
lating the spectral amplitudes. Consider the following signal 
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set 



St 

S2 — S5 
Se — Sg 
Sio — 5*16 



0, 

q{0.5j) = qi, 
q(0.25j) = 92, 
q(0.25i,0.5j)-(g3,<74) 



We make a 3-PAM constellation on qi E {0.5,1,1.5}. This 
creates a signal set with 16 elements. Here pulses are extended 
to 3Ti time duration. The resulting constellation has average 
power I.O6P0 and average time duration 2.236Ti, where 
Pq and Ti are the power and the symbol-duration of the 
benchmark on-off keying system. Therefore the new signal set 
provides about 2 23'6TiVq ~ ^''^^ ^ bits/s/Hz and operates 
at i? = 1.79 X Pq for about the same average power If 
we fix symbol durations to be the maximum 3Ti, then the 
improvement is p = 2.2po ~ 0.73 bits/s/Hz, at 80% of the 
average power 

Again, since the real part of the eigenvalues is not modu- 
lated, signals do not suffer from major temporal or spectral 
broadening. 

Remark 5. Note that modulating the eigenvalues includes only 
the amplitude information (similarly to M-ary frequency-shift- 
keying). To excite the other half of the degrees-of-freedom 
representing the phase, discrete spectral amplitudes should 
also be considered. While |g(Aj)| may be noisy, the phase 
Zq{Xj) or a function of {^(Aj)}^"^ can be investigated for 
this purpose. Note however that the asymptotic behavior of the 
spectral efficiency can be obtained, within a constant factor, 
by considering the eigenvalues alone. 

C. Spectral Efficiency of N-Soliton Systems, N > 3 

To achieve greater spectral efficiencies, a dense constellation 
in the upper-half complex plane needs to be considered. A 
spectral constellation with n possible eigenvalues in C+ (from 
which k eigenvalues are chosen, < fc < n) and m possible 
values for spectral amplitudes provides up to 



log E 



n log (to + 1) , 



bits per symbol (fewer if a subset is chosen). One can continue 
the approach presented in the previous examples by increasing 
n and to. The receiver architecture presented in [Part I] is fairly 
simple and is able to decode NFT signals rather efficiently. 

Some choices of spectral parameters may translate to pulses 
having a large peak to average power ratio, large (99%) 
bandwidth, or large (99%) time duration at z = or during 
their propagation; hence, the signal set should be expurgated 
to avoid such undesirable signals. We have not yet found rules 
for modulating the spectrum so that such undesirable signals 
are not generated. For the small examples given here, we can 
check pulse properties directly; however, appropriate design 
criteria for the spectral data (particularly the discrete spectral 
amplitudes) should be developed. 

In this simulation, we assume a constellation with 30 points 
uniformly chosen in the interval < A < 2 on the imaginary 
axis and create all A^-solitons, 1 < iV < 6. We then prune 



signals with undesirable bandwidth or duration from this large 
signal set. The remaining multi-solitons are used as carriers 
of data in the typical fiber system considered earlier Here 
a spectral efficiency of 1.5 bits/s/Hz is achieved. For this 
calculation, we take the maximum pulse width (containing 
99% of the signal energy) and the maximum bandwidth of 
the signal set. By increasing n and m, pulse widths get large 
and the shift of the signal energy from the symbol period, due 
to the Gordon-Haus effect, becomes less significant. Gordon- 
Haus effect for solitons is as important as it is for sine function 
transmission and backpropagation. 

We would note that the spectral efficiency reported here 
was achieved using only a rather simplistic design approach. 
We believe that a more sophisticated search over the design 
space, in particular exploiting the possibility of more cleverly 
modulating spectral ampUtudes and phase and choosing eigen- 
values in a region not limited to the juj axis, is likely to yield 
significantly higher spectral efficiencies. 

The recent work of ll28l also describes optical transmission 
schemes based on iV-soliton transmission and the inverse 
scattering transform, with reports on some achievable spectral 
efficiencies. 

D. Spectral Efficiencies Achievable by Modulating the Con- 
tinuous Spectrum 

In addition to the discrete spectrum, the continuous spec- 
trum can, in some cases, also be modulated. 

Here we consider the modulation of classical raised-cosine 
pulses, with 50% excess bandwidth. The continuous spectrum 
of an isolated raised-cosine pulse is purely continuous at low 
amplitudes, resembling its ordinary Fourier transform. We 
modulated the amplitude of an isolated raised-cosine pulse, 
propagated the pulse over a fiber channel, and estimated 
the continuous spectrum of the received signal. The received 
spectrum was then compared with the spectrum of all pos- 
sible transmitted waveforms at the transmitter using the log- 
Euclidean distance 

00 

d((72(A),gi(A)) = i j log (l + |g2(A)-gi(A)|') dA. 
—00 

Fig. [Sfa) shows the estimated rates achievable by a typical 
single-channel fiber-optic system, comparing detection after 
filtering, backpropagation, and matched-filtering, with detec- 
tion using the nonlinear Fourier transform. A multi-ring QAM 
modulation was used, with rings at 16 distinct amplitude 
values and 32 different phase values per ring. The complex 
plane was quantized into the Voronoi regions corresponding 
to the ring constellation, to induce a discrete channel, whose 
capacity was computed via the Blahut-Arimoto algorithm. The 
NFT is calculated at either 64 or 1024 uniformly spaced 
discrete points on the real axis over a range containing most 
of the pulse energy. The 1024-point NFT is compared with 
a 1024-point FFT implementation of the Split-Step Fourier 
Method in backpropagation. As can be seen, the NFT and 
backpropagation methods achieve approximately the same 
rates. The slight improvement in the NFT method can be 
attributed to the stability of the continuous spectral data 
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compared to the traditional time data (amplitude and phase 
at the output of a matched filter). 

This simulation was repeated for 5 WDM channels with 
the system architecture of Fig. [1] Here, low-pass filters and 
ROADMs are placed at the end of each fiber segment. The 
spectral efficiency in this case is obviously lower due to 
the inter-channel interference. Here too, NFT and FFT-based 
backpropagation produce approximately the same results. 

From Fig.|8]it follows that at low SNRs NFT and backprop- 
agation achieve about the same rates. As the SNR is increased, 
the spectral efficiency of backpropagation degrades due to ISI 
(if memory is not accounted for) or inter-channel interference 
(inevitably). We conjecture that the spectral efficiency of the 
NFT scheme will grow further due to its inherent immunity 
to cross-talk, provided that users are multiplexed appropriately 
in the nonlinear frequency domain. We have not yet simulated 
the spectral efficiency at SNRs beyond that of Fig. |8]due to the 
large numerical complexity involved. The potential advantage 
of NFDM thus remains to be illustrated. 

VI. Discussion 

In this section, we make a few remarks about the NFT 
method, clarifying some of its properties and potential practi- 
cal applications, as well as identifying some possible directions 
for future investigation. 

A. Multiuser Communications Using the NFT 

A potential gain is achievable in fiber-optic systems by 
employing methods, such as NFDM, which are less prone to 
inter-channel interference. 

Recall that in NFDM the real and imaginary axes of 
the complex plane correspond, approximately, to the signal 
frequency and energy. To use NFDM as a multiuser com- 
munication method, we can partition the complex plane into 
disjoint regions, e.g., vertical bins. Each bin can be assigned 
to a user and can contain one or more degrees-of-freedom. To 
multiplex user signals, traditional ROADMs must be replaced 
with nonlinear add-drop multiplexers (NADMs) which func- 
tion according to the NFT. In principle, each NADM would 
compute the spectrum of its input signals and filter the signals 
to be dropped in C+. It would then places the spectrum of 
the signals to be added in empty bands and produce the 
output signal by taking the inverse NFT. In this way, each 
user is assigned a region in the complex plane and does not 
interfere with other users (at least in the absence of noise — see 
Section IV^BT i. 

B. Noise in the Spectral Coordinates 

In a nonlinear interference channel, the interference can 
have two components. The first, termed "deterministic interfer- 
ence," arises from the (deterministic) interaction of the signals 
of other users with the user of interest, and in general is present 
even in the absence of noise. The second, termed "stochastic 
interference," arises from the (stochastic) coupling of noise 
with the signals of other users, interfering with the channel of 
interest. Thus, noise can affect the channel of interest directly 



(in-band noise) and indirectly (by introducing interference). 
Typically deterministic interference is stronger than stochastic 
interference. 

The NLS equation with additive noise has no known inte- 
grability structure, in the sense of possessing a set of non- 
interacting degrees-of-freedom. As a result, while the NFT 
method does not suffer from strong deterministic interference, 
a weak stochastic interference is expected to be present. In 
other words, even when users are multiplexed so that they 
do not interact in the absence of noise, the addition of 
noise can potentially re-introduce (stochastic) interference. In 
comparison, WDM with backpropagation is subject to both 
strong and weak interference. 

Finally, note that an additive (in-band) white noise in the 
time domain has coordinates in the spectral domain which may 
not be independent. Such correlations should be accounted for 
when designing signal detectors. 

C NFDM versus OFDM 

NFDM and OFDM are essentially identical in the absence 
of noise. However, as noted above, in cases where the presence 
of noise breaks the integrability structure, then, unlike OFDM, 
the signal degrees-of-freedom are not independent in NFDM. 

Note also that, while the ordinary Fourier transform of a 
signal is a function of a real variable, the NFT of a signal is 
generally defined on the whole complex plane, i.e., nonlinear 
frequencies are complex-valued. Since complex frequencies 
in the upper-half complex plane C+ are isolated points, this 
component in the support of the NFT can be modulated 
too. OFDM is conceptually in analogy with the continuous 
spectrum modulation, where only the spectral amplitudes are 
modulated. The discrete spectrum differs further from the 
continuous spectrum in that it is the frequencies in C''" 
themselves that contribute to the signal energy, and not their 
spectral amplitudes. 

The analogy between NFDM and OFDM can be better 
understood in fibers with negative dispersion parameter In 
this case, q* is replaced with g in (|5]l and the (noise-free) 
channel is still integrable. Here, similar to OFDM, the non- 
Unear frequencies are real and information is encoded only 
in spectral amplitudes. This case is appealing analytically and 
numerically, since the underlying L operator is Hermitian. 

D. Advantages and Disadvantages of NFDM 

a) Advantages: Some of the advantages of using NFDM 
were outlined in Section|l] In short, all deterministic distortions 
(SPM, XPM, FWM, ISI and interference) are zero for all users 
of a multiple-user network. 

b) Disadvantages: 

1) NFDM critically relies on the integrability of the chan- 
nel. Loss, higher-order dispersion, and other perturba- 
tions caused by filters and communication equipment not 
taken into account in this study contribute to deviations 
from integrability. There are, however, several reasons to 
believe that the overall channel from the transmitter to 
the receiver can still be close to an integrable channel: 
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Fig. 8. Achievable rates in (a) a single channel, and (b) a WDM optical fiber system using the nonlinear Fourier transform and baclcpropagation. The SNR 
is calculated at the system bandwidth and can be adjusted to represent the optical signal-to-noise ratio. 
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Fig. 9. Partitioning C"*" for multiuser communication using the NFT. 



a) Using Raman amplification, the effects of loss are 
minimal (indeed traded with noise perturbation). 

b) The NFT is applicable as long as the system can 
support soliton transmission. Solitons have been 
implemented in practice in the presence of commu- 
nication equipment (filters, multiplexers, analog-to- 
digital (A/D) converters, etc). This is an indicator 
that the overall channel is still nearly integrable. 

c) Mathematically one has stability results for solitons 
||29l . A soliton passing through a filter might be 
slightly distorted, but it re-organizes its shape so 
as to revert back to its original shape (or to form 
a soliton living nearby in the complex plane). 

d) Considering that the performance of the WDM 
method degrades asymptotically with SNR, it may 
be worthwhile to investigate NFDM, and identify 
and minimize the perturbations of integrability. 

2) The nonlinear Fourier coefficients at the receiver are cal- 
culated using 0{n) operations per nonlinear frequency, 
where n is the number of signal samples in time. In 
contrast with with the 0{n log n) complexity of the FFT, 
the complexity of an n-point NFT is, at present, 0{n^) 
(assuming a fixed number of Newton steps are needed 
in the case of discrete frequencies). The complexity 
of the transmitter can be even higher. As a result, the 
NFT is currently computationally difficult to implement 
or sometimes to simulate. It is therefore of interest to 
develop faster algorithms. 

3) Optical or electrical signal processing of A^-solitons 
and their required hardware (e.g., A/Ds) may not be 
as simple as those in linear systems. The NFT decoder 



typically requires signal samples at increments smaller 
than the Nyquist period. An interpolation step may be 
needed to find all the necessary data. 

E. Achievable Rates Using NFDM 

As mentioned earlier, a weak stochastic interference can 
be present in NFDM. As a result, although the achievable 
rate of NFDM is expected to be higher than that of WDM 
with backpropagation, it too may ultimately peak at some 
finite SNR, and decline at higher SNRs. We have not yet 
simulated the capacity at high SNRs to see when this may 
potentially happen. However, note that, regardless of the 
method of transmission, the noisy channel will fundamentally 
be interference-limited due to the lack of integrabilit}0. 

Note that the decline of the achievable rates in WDM 
simulations in prior work is mostly due to the (strong) de- 
terministic interference. For instance, capacity simulations in 
JH, lUni which show that the rates of the WDM method 
vanish at high powers even if noise is set to zero. This is 
not the case for multiuser NFDM whose achievable rate is 
unbounded when noise is absent. We expect improvements in 
the information rates using NFDM, due to the immunity to 
the (strong) deterministic interference. 

F. Eigenvalue Communication 

As noted in [Part I], the work of Hasegawa and Nyu on 
"eigenvalue communication" llT6ll is strongly related to the 
NFT-based approach taken in this paper. In lfT6l . the authors 

Since, for instance, the NLS equation with a generic additive potential 
does not have conserved quantities. 
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considered single-user channels, encoding information in con- 
served quantities, and use the inverse scattering transform 
(1ST) as a means to decode these conserved quantities. The 
idea is illustrated for pulses of the form Asech{t). 

The use of conserved quantities in a communication channel 
is desirable since it facilitates deterministic signal processing 
at the receiver and may simplify communication design. How- 
ever, fundamentally it does not offer any capacity improvement 
if one uses an equivalent set of non-conserved quantities {e.g., 
amplitude and phase). It is thus not necessary to aim at 
extracting and modulating quantities conserved by the channel. 

In contrast, our primary motivation for introducing NFT- 
based methods stems from recent capacity studies 10] and our 
observation, made in Section [III that inter-channel interfer- 
ence is the dominant rate-limiting impairment in multiuser 
fiber optic networks. The crucial property of the NFT is its 
capability to eliminate deterministic interference in multiuser 
networks. This observation was not made in [16|; indeed, the 
use of the 1ST as a solver, or a bridge to conserved quantities, 
does not provide any advantage compared to alternative signal 
processing methods such as digital backpropagation. 

G. The Discrete Nonlinear Fourier Transform 

To implement NFDM in practice, the discrete nonlinear 
Fourier transform, in which time domain signal is discrete 
and periodic, should be implemented. The development of the 
discrete nonlinear Fourier transform exists in the mathematics 
Uterature, taking a similar form to the continuous one [|18J . 

ED. 

VII. Conclusions 

Motivated by recent studies showing that the achievable 
rates of current methods in optical fiber networks vanish 
at high launch power due to the impact of nonlinearity, in 
[Part I], [Part II], and this paper we have revisited information 
transmission in such nonlinear systems. In these papers, we 
suggested using the nonlinear Fourier transform to transmit 
information over integrable communication channels such as 
the optical fiber channel, which is governed by the nonlinear 
Schrodinger equation. In this transmission scheme, informa- 
tion is encoded in the nonlinear Fourier transform of the 
signal, consisting of two components: a discrete and a con- 
tinuous spectral function. With this new method, deterministic 
distortions arising from the dispersion and nonlinearity, such 
as inter-symbol and inter-channel interference are zero for a 
single user channel or all users of a multiuser network. 

We took the first steps towards the design of a communica- 
tion system implementing the nonlinear Fourier transform. We 
proposed a Darboux-transform-based algorithm for modulat- 
ing the discrete spectrum at the transmitter, and we provided 
a first-order perturbation analysis of the influence of noise on 
the received spectrum. Furthermore, we provided examples 
illustrating how the NFT can be used for data transmission. 
Although these small examples clearly demonstrate improve- 
ments over their benchmark systems, more sophisticated large- 
scale simulations are required to demonstrate the potential to 
achieve high spectral-efficiencies. 



Because nonlinearity is a key feature of fiber-optic net- 
works, the development of nonlinearity-compatible transmis- 
sion schemes, like those based on the nonlinear Fourier trans- 
form, is likely to continue to be a fruitful research direction. 

Appendix A 
Solution of Hirota Equations 

Because (fT2l) and ( fTsT l are homogeneous in the order of 
derivatives that occur in each term, exponential functions 
are candidate solutions. This suggests that F and G can be 
expanded (linearized) as 

F{t, z) = fit, z) + e/(i)(t, z) + eV^'' (i, ^) + • ' • , 
G{t, z) = g\t, z) + eg^^\t, z) + e^g^^\t, z) + ■ ■ ■ , 

for some small parameter e. We then obtain 

FG = /(°V°)+e(/(%W+/W5W) 
+ £2 (/(").g(2) + /(i).g(i) + /(2)g(0) 
p2 ^ (^(o))2 ^ 2e/(o) /(I) + £2 (^2 + + . . . , 

+ 6^ (.g(V+.g^V°^' + |.gW|2 

which can then be substituted into (fT2] |-(fT3]l. Equating like 
powers of e, one obtains bilinear equations for sub-components 
j(OyO) using the D operator As shown below (for the NLS), 
this series truncates for integrable systems and exact solutions 
of various finite order are obtained. To begin finding unknowns 
recursively, we can set initially Z*^*^) = 1. 

The zero-order term in (ITZt gives j.gi°'' + g^J''' = 0, i.e., 
g{0) ^ ^gXo^ where Xq = 2jXot - k^z + (p^. Here Aq = 
(a + jw)/2 is the eigenvalue, fco — 4^^^, and (/)o is a constant 
phase related to the spectral amplitude ((/)o = log(g/a;2)). The 
zero-order term in (ITjt gives c — Q, and thus g^'^^ — 0. At this 
stage, we can set the higher order terms to zero and get the 
trivial solution g = 0. 

The e term in (fTSl i gives g^^^ = ae^^. The corresponding 
term in (Ql gives 20} = 0, i.e., /(i) = 0. 

The e2 term in (US is D'lf^'^ = 2\g^^^\'^, or 2/^^^^ = 
2|5(i)|2. Trying = ae^^+^'^ , we get a = 
At this stage, we can set other higher order unknowns to zero 
and get the one soliton solution (choosing e = 1) 



F{t,z) = 14 
G{t,z) = 
q{t,z) 



1 



4(A- A*)2 



1 



4(A-A*)2 
= wsech [w {t - 2az - to)] eJ"*-J("'-"')^, 

where <o = ^og{\q\ / uj) / uj . 

We can continue to find higher order solitons. For instance, 
the £2 term in (O, i.e., {jD^ + Dl)l ■ g2 = 0, gives 52 = 
a2e-^^, and so on. 
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Appendix B 
Proof of the Darboux Theorem 

See fT9l for a more general theorem and its proof; here we 
give a simple proof for Theorem [1] 

Let (j){t, A; q) be a known eigenvector associated with A 
and q, i.e., satisfying (j)t = P{q.X)(t>. Its adjoint <f>{t,X) ~ 
[02,-</'i] satisfies 0t(i, A) = P{q, X*)(t>{t, X). Denote this 
known solution as = [0,0], T = diag(A, A*), and E — 

We can verify that St = JST + QS, where J = diag(j, -j) 
and Q = offdiag(g, — g*). In addition we have — [JE + 
Q,E]. 

Given that (j){t, A; g) is known, the Darboux transformation 
maps {v{t,fj.;q),v{t,fi;q)} to {u{t, fi; q),u{t, q)} accord- 
ing to 

u ^VA- j:v, 

where V — [v, v], U = [u, u], A = diag(/i, fi*). 
We have Vt = JVA + QV and 

Ut^VtA-{i:tV + i:Vt) 

= {JVA + QV)A - ([JE + Q, E]F + E( JFA + Ql/)) 
= {JVA + QV)A - EJt/A - {[JE + Q, E] + EQ} V" 
= J{VA - E1/)A + JEFA - E JFA 

+ QV^A - ([JE + Q, E] + Eg) F 
= JUA + [J, T.]VA - ([JE + Q, E] + Eg) + QVA 
= JUA + [J, E]l/A - ( JE^ + gE - E JE) V + QVA 
= JUA + [J, E]l/A - [J, E]E1/ - gEV" + g^A 

= JUA + [J, T,]{vA - EF) - gEv^ + gv^A 

= JUA + [J, E][/ + g(\/A - EV^) 
= JC/A+(g+[J,E])C/ 
= JC/A + gj7, 

where g = g + [J, E]. 

In the same manner we can show that u and u satisfy the 
M-equation Vz — M (A, q) and vt = M {X* ,q). 
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